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Abstract. We describe the construction of the Brera Multi-scale Wavelet (BMW) HRI Cluster Survey, a deep 
sample of serendipitous X-ray selected clusters of galaxies based on the ROSAT HRI archive. This is the first 
cluster catalog exploiting the high angular resolution of this instrument. Cluster candidates are selected on the 
, basis of their X-ray extension only, a parameter which is well measured by the BMW wavelet detection algorithm. 

^ ■ The survey includes 154 candidates over a total solid angle of ~ 160 deg^ at 10~^^ erg s~^ cm~'^ and ~ 80 deg'^ at 

1.8x10"^^ erg s~^ cm~'^. At the same time, a fairly good sky coverage in the faintest flux bins (3 — 5x10"^* erg 
CO . s~^ cm~^) gives this survey the capability of detecting a few clusters with z ~ 1 — 1.2, depending on evolution. 

We present the results of extensive Monte Carlo simulations, providing a complete statistical characterization of 
the survey selection function and contamination level. We also present a new estimate of the surface density of 
^— ■ clusters of galaxies down to a flux of 3xl0~^* erg s~^ cm~^, which is consistent with previous measurements 

from PSPC-based samples. Several clusters with redshifts up to z = 0.92 have already been confirmed, either 
by cross-correlation with existing PSPC surveys or from early results of an ongoing follow-up campaign. Overall, 
■ these results indicate that the excellent HRI PSF (5" FWHM on axis) more than compensates for the negative 

Oh' effect of the higher instrumental background on the detection of high-redshift clusters. In addition, it allows us 

^ , to detect compact clusters that could be lost at lower resolution, thus potentially providing an important new 

S_j ' insight into cluster evolution. 

. Key words. X-rays: galaxies: clusters survey 
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1. Introduction Lx and temperature Tx show fairly tight relations with 

^ , the cluster mass (e.g. Evrard et al. 1996; Allen et al. 2001; 

. .C^ , Clusters Of galaxies represent the largest collapsed objects j^^^p^.^^ ^ Bohringer 2002; Ettori et al. 2004). A full com- 

in the hierarchy of cosmic structures, resulting from the ■ c -u^ i- t ^■ 

, . . ' . ^ . prehension ot these scaling relations requires more mgre- 

growth of fluctuations lying on the high-density tail of the j- ; .i • i i i- j • j-i 4. 

/ \ dients than a simple heating during the growth 01 nuctua- 

matter density field (e.g. Peacock 1999) As such, their ^.^^^ ^^^.^^^ ^ggg. ^^^^^^^ ^ p^^^^^ 2000; Finoguenov 

number density and evolution are strongly dependent on , , r,nni td • ^ 1 r,nnA\ tj 4-1, • • i 

, 1 , 1 r 6t al. 2001; Borgani et al. 2004). However, their existence 

the normalization of the power spectrum and the value of j 1 i- 1 ^ • i ^ -c 1 u 

. ^ / T-> ■ ^ f,r,n relative tightness give clusters a specific role as probes 

the density parameter Hm (e.g. Borgani & Guzzo 2001; r 1 • 1 j 1 -j- -iu 4 

, iij V b to 'of the cosmoiogical model, providing us with a way to 

Rosati et al. 2002). In addition, clusters are usually con- ^^^^ ^^.^^^ ^.^^^^^^ ^^^^^.^^ ^^ ^^ Bohringer et al. 

sidered as "simple" systems, where the physics involved in 2002; Pierpaoli et al. 2003) and the mass power spectrum 
tmmng "mass into Hght" is possibly easier to understand, (gchuecker et al. 2003), via respectively the observed clus- 

compared to the various complex processes connected to , v 1 ■ r i- rvT T^^ j i ^ ■ t 

^ . ... ter X-ray luminosity function (XLh ) and clustering, in 

star formation and evolution in galaxies. In particular in jj-,- j n • ^ ^ 1 ^ ^ j-a- ^ j 

^ addition, and equally important, clusters at dinerent red- 

the X-ray band, where clusters can be defined and rec- , .n, -j u 1 r j.- n 1 

. shifts provide homogeneous samples ot essentially coeval 

ognized as single objects (not iust as a mere collection , . . 1 • u j -j. • i j • 

" o J \ J galaxies m a high-density environment and give the possi- 

of galaxies), observable quantities like X-ray luminosity ^.j.^^ studying the evolution of stellar populations (e.g. 

Send offprint requests to: moretti@merate.mi.astro.it Blakeslee et al. 2003; Lidman et al. 2004). 

* Partially based on observations taken at ESQ and TNG X-ray based cluster surveys, in addition to a fairly di- 

telescopes. rect connection of the observed quantities to model (mass- 
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Fig. 1. The spatial distribution (equatorial coordinates, Aitoff projection) of the 914 HRI pointings used for the 
BMW-HRI cluster survey. The origin of right ascension is in the center of the plot and the grid is in steps of B'^ 
in RA and 30° in DEC. The empty area corresponds to the \bii\ < 20° zone of avoidance, while the boxes mark 
the Magellanic clouds and Virgo cluster areas which were also excluded from the survey. Different symbols indicate 
different ranges of exposure time, as explained. 



specific) predictions, have one further, fundamental ad- 
vantage over optically-selected catalogues: their selection 
function is well-defined (being essentially that of a flux- 
limited sample) and fairly easy to reconstruct. This is a 
crucial feature when the goal is to use these samples for 
cosmological measurements that necessarily involve a pre- 
cise knowledge of the sampled volume, as is the case when 
computing first or second moments of the density field. 
X-ray surveys in the "local" Universe {{z) ~ 0.1), stem- 
ming from the ROSAT All-Sky Survey (RASS, Voges et 
al. 1999) have been able to pinpoint quantities like the 
cluster number density to high accuracy. The REFLEX 
survey, in particular, has yielded the currently most ac- 
curate measurement of the XLF (Bohringer et al. 2002; 
2004) , in substantial agreement with other local estimates 
(Ebeling et al. 1997; De Grandi et al. 2001). These result 
provide a robust z ^ reference frame to which surveys 
of distant clusters can be safely compared in search of 
evolution. 

Recent X ray sciarchcs for serendipitous high-redshift 
clusters have been based mostly on the deeper pointed im- 
ages collected with the ROSAT PSPC instrument (RDCS: 



Rosati et al. 1995; SHARC: Colhns et al. 1997; 160 Square 

Degrees: Vikhlinin et al. 1998a; WARPS: Perlman et al. 
2002) or on the high-exposure North Ecliptic Pole area of 
the RASS (NEP: Gioia et al. 2003); a deeper search for 
massive clusters in the overall RASS is also being carried 
out (MACS, Ebeling et al. 2001a). Results from these sur- 
veys consistently show a lack of evolution of the XLF-"^ for 
L < L*^ ~ 3 • 10'''* erg s~^ out to z ~ 0.8, pointing to 
low values of under reasonable assumptions on the 
evolution of the Lx — Tx relation (see Rosati et al. 2002 
for a review). At the same time, however, they confirm 
the early findings from the Einstein Medium Sensitivity 
Survey (EMSS, Gioia et al. 1990; Henry et al. 1992) of a 
mild evolution of the bright end (Vikhlinin ct al. 1998a; 
Nichol et al. 1999; Borgani ct al. 2001; Gioia et al. 2001; 
Mullis et al. 2004). In other words, there is an indica- 
tion that above z ^ 0.6 one finds less very massive clus- 
ters, likely indicating that beyond this epoch they were 

* All through this paper, we shall adopt a "concordance" 
cosmological model, with Ho = 70 km s~^ Mpc~^, Q,m = 0.3, 
= 0.7, and — unless specified — quote all X-ray fluxes and 
luminosities in the [0.5-2] keV band. 



Moretti et al.: The BMW-HRI Cluster Survey 



3 



still to be assembled from the merging of smaller mass 
units. These conclusions, however, are still based on a 
rather small number of high-rcdshift clusters. Currently, 
we know ~ 15 X-ray confirmed clusters above z = 0.8, 
with only 5 so far detected above 2; = 1, all of which 
are at z < 1.3. In addition, virtually all ciurcnt statis- 
tical samples of distant clusters have been selected from 
X-ray images collected with the same, low angular res- 
olution ROSAT-PSPC instrument. Clearly, the need for 
more samples of high-redshift clusters, possibly selected 
from independent X-ray imaging material, is compelling. 

As a first contribution in this direction, during the 
last four years we have constructed a new sample of 
distant cluster candidates based on the still unexplored 
ROSAT-HRI archive, the BMW-HRI Cluster Survey. 
While serendipitous searches are already focusing on the 
fresh data being accumulated in the archives of the new 
powerful X-ray satellites XMM- Newton and Chandra (see 
e.g. the pioneering work by Boschin 2002), our work shows 
that a remarkable source of high-redshift clusters — the 
HRI archive — has so far been neglected. In this paper we 
discuss in detail the selection process, the properties of 
this catalogue, its completeness function and contamina- 
tion. We then compute the sky coverage, number counts 
and expected redshift distribution of BMW-HRI clusters. 
The paper is organized as follows. Sect. 2 describes the 
general features of the HRI data and the field selection; 
Sect. 3 discusses the cluster detection and characteriza- 
tion; Sect. 4 presents the results of extensive Monte Carlo 
simulations, performed in order to understand the statisti- 
cal properties of the cluster sample, in particular its com- 
pleteness and contamination level; Sect. 5 uses all this in- 
formation to derive the survey sky-coverage as a function 
of source flux and extension, and to compute the survey 
expected redshift distribution and mean surface density; 
Sect. 6 presents a few examples of distant clusters already 
identified in the BMW-HRI survey, while the last section 
summarizes the main results obtained in the paper. 

2. Survey Description 

2.1. The HRI instrument 

The High- Resolution Imager (HRI) was the secondary in- 
strument on board the ROSAT satellite, with technical 
features quite different from and complementary to the 
companion PSPC, including in particular a much better 
spatial resolution. The core of the HRI is a micro-channel 
plate detector with an octagon-like shaped field of view 
(with ^ 19' radius) that reveals single X-ray photons, pro- 
viding information on their positions and arrival times. 
The HRI Point Spread Function (PSF) as measured on- 
axis is about 5" FWHM, i.e. a factor of ^ 4 better than 
the PSPC ^. On the other hand, the HRI is less efficient 
than the PSPC (a factor of 3 to 8 for a plausible range of 

^ For completeness, note that this is still a factor of ~ 5 worse 
than that of the Chandra X-ray Observatory, whose axehive 
represents a further source of high-resolution data that is cur- 
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Fig. 2. Distribution of exposure times for the 914 point- 
ings used for the cluster survey. 



incident spectra) and has a higher background. This con- 
sists of several components: the internal background due 
to the residual radioactivity of the detector (1-2 cts s^^), 
the externally-induced background from charged particles 
(1-10 cts s^^) and the X-ray background (0.5-2 cts s^^). 

The HRI covers the energy range [0.1-2.4] keV, divided 
into 16 Pulse Height Analyzer (PHA) channels, which 
provide very crude spectral information (Prestwich et al. 
1996). The HRI background is highest in the first few (1- 
3) PHA channels and, at variance with the ROSAT PSPC, 
it is dominated by the unvignetted particle background. 
As shown by sky calibration sources, most of the source 
photons instead arrive in a PHA channel between 3 and 
8 (David et al. 1998). A priori, the high background of 
the HRI can be a serious problem for cluster searches. 
The low surface brightness of these diffuse objects is in- 
deed further suppressed as a function of redshift, due to 
the cosmological dimming oc (1 + z)* and rapidly drops 
below the background. Coupled to the HRI lower sensi- 
tivity, this explains why all ROSAT serendipitous cluster 
searches have so far concentrated on the PSPC archive 
(see Sect. 1). We shall show here how these early, pes- 
simistic conclusions are fortunately not fully correct. As 
detailed in Campana et al. (1999, C99 hereafter), in order 
to minimize the background and increase the signal-to- 
noise (S/N) ratio of X-ray sources, especially low-surface- 
brightness ones, the BMW wavelet analysis has been re- 
stricted to PHA 2-9 (see also David et al. 1998). This 
range reduces the detector background by about 40% with 
a minimum loss of cosmic X-ray photons (< 10%; David 
et al. 1998). This is certainly one; key feature in improv- 
ing the detection of clusters of galaxies with these data, 



rently also under scrutiny using our algorithms (Romano et al.. 
in preparation) 
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Table 1. Regions around the LMC, SMC and Virgo clus- 
ter excluded from the survey. 



region RA DEC 



LMCl [3'' 52', 52'] 
LMC2 [5'' 24', 56'] 


[-77°, 


-63°] 


[-63°, 


-58°] 


LMC3 [6'' 52', 12'] 
SMCl [23'' 52', l'' 20'] 
SMC2 [23'' 46', 23*^ 54'] 
SMC3 [01'' 20', 02'' 00'] 


[-74°, 


-68°] 


[-77°, 


-67.5°] 


[-77°, 


-73°] 


[-72°, 


-67.5°] 


Virgo [12'' 20', 12'' 44'] 


[7.0°, 16°] 



thus allowing studies to reap the benefit of the excellent 
resolution of this instrument. 

2.2. The surveyed area 

The overall BMW-HRI serendipitous catalogue (Panzera 
et al. 2003, P03 hereafter; Lazzati et al. 1999, L99 here- 
after; C99) is based on the analysis of 4298 observations, 
i.e. the whole HRI archive after excluding calibration ob- 
servations, fields pointed at supernova remnants and some 
other problematic pointings. The selection of cluster can- 
didates imposes a number of extra selections, which are 
discussed here. We first selected only high Galactic lat- 
itude fields {\b\ > 20°) with exposure times larger than 
1000 seconds, excluding also the Magellanic Clouds and 
Virgo Cluster regions, defined in Tabled Moreover, we ex- 
cluded all pointings on star clusters, clusters and groups of 
galaxies and obviously Messier and NGC nearby galaxies. 
In all these cases the target tends to fill the field-of-view 
of the instrument, thus biasing any search for serendipi- 
tous sources in the surrounding area. For this reason, we 
further visually inspected all the remaining fields using 
panoramic (40' x 40') DSS2 images with the HRI X-ray 
contours overlaid, so as to ascertain whether the central 
target could in any way infiuence the searched region (as 
e.g. in the case of a spiral galaxy with faint arms extend- 
ing out of the central 3-arcmin radius area) . All fields with 
such "problematic" targets were excluded from the cluster 
candidate catalogue. 

One further problem needing a careful treatment con- 
cerns multiple observations of the same sky areas. All 
these multiple pointings have been analyzed separately 
by the BMW-HRI general survey: consequently, the cat- 
alogue lists every source detected in each single observa- 
tion, including multiple detections of the same source. To 
simplify the treatment of such cases, when multiple obser- 
vations looked "exactly" at the same region of sky, defined 
as images with nominal aim points separated by less than 
30", we discarded all but the deepest exposure. However, if 
the separation was larger than this, we kept all the (usu- 
ally overlapping) fields. The goal here was to maximize 
the survey area, which is the main factor in finding dis- 
tant luminous clusters. These overlaps have been properly 
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Fig. 3. Source extensions as measured by the BMW de- 
tection algorithm {Wdu, see text), plotted against the off- 
axis angle, showing our empirical definition of extended 
sources. The continuous line gives the mean extension of 
a point-like source at different off-axis angles, and the 
dashed lines identifies the 3 a locus. Sources selected as 
cluster candidates have to lie, with their error bars, above 
the dashed line and are indicated by filled circles. 

treated in the computation of the actual survey sky cov- 
erage, as explained in detail in Sect. 15. ll 

Within each selected field we then considered only 
sources detected in the area comprised between 3' and 
15' off-axis angle. This excludes the central part (< 3') 
of the field of view, which is normally where the target 
is pointed, and the very external part (> 15'), where the 
sensitivity drops and the PSF worsens. We ended up with 
a grand total of 914 HRI observations (Fig^l with expo- 
sure times ranging from 1 ks to 204 ks (FigI21) , for a total 
surveyed area of 160.4 square degrees, ~ 7% of which were 
observed twice or more. 

3. Selection of Cluster Candidates 

3.1. Source Extension Criterion 

The BMW detection is based on a wavelet transform 
(WT) algorithm for the automatic detection and char- 
acterization of sources in X-ray images. It is fully de- 
scribed in L99 and has been developed to analyze ROSAT 
HRI images, producing the BMW-HRI catalogue (C99, 
P03). Candidate sources are identified as local maxima 
above a given threshold in wavelet space; the preliminary 
product of the detection procedure is a source list with 
a rough determination of the position (the center of the 
pixel with higher coefficient), source size (the scale of the 
WT where the signal to noise is maximized) and total 
number of photons (the value of the maximum WT coeffi- 
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cient). Since the wavelet scale steps are discrete, the final 
catalogue parameters are a refinement of these values and 
are obtained through ^ minimization with respect to a 
Gaussian model in WT space (for all the details of the 
fit procedure in the WT space see L99). In particular, 
the BMW catalog source extension parameter (hereafter 
Wc;«) is defined as the width of the best-fitting Gaussian. 

The characterization procedure of extended sources 
has been explained in C99, but we briefly repeat it here 
for convenience of the reader. First, we considered a sub- 
sample of sources detected only in the observations that 
have a star as target (ROR number beginning with 2). 
This resulted in 6013 sources detected over 756 HRI fields 
(Fig.OJ. The distribution of source extensions was divided 
into bins of 1', as a function of the source off-axis angle, 
and within each bin, a cr— clipping algorithm on the source 
extension was applied: the mean and standard deviation in 
each bin are calculated and sources at more than 3 a above 
the mean value are discarded. In this way we determine 
the mean instrumental extension (Wp(0)) and standard 
deviation Op(ff) of point-like source observed at different 
Q. As shown in Fig.O this provides us with a threshold for 
defining truly extended sources (Rosati et al. 1995). We 
define as candidate clusters all sources that lie above the 
3ap(0) corridor, including their luciu error bar, i.e. those 
sources for which, at their observed off-axis angle 9, one 
has^ 

Wdu - (J,iu > {Wp{e)) +3ap{0) . (1) 

This combined requirement on the distance from the 
point-like source locus, and on the intrinsic error in the 
source extension, roughly corresponds to a ~ 3.5(7 confi- 
dence level for the extension classification. 

3.2. Significance Criterion 

To reduce the contamination level of the catalogue due to 
spurious detections, we also limit the current cluster se- 
lection to sources with detection significance larger than 
4(7 (see Sect. I4.3|l . The source significance Ps is defined 
as the confidence level at which a source is not a chance 
background fluctuation, given the background statistics 
and the specific field exposure time. This quantity is as- 
sessed via the signal to noise ratio in wavelet space. For 
each wavelet scale, the noise level is computed through 
numerical simulations of blank fields with the correspond- 
ing background (L99), while the signal is the peak of the 
wavelet coefficients corresponding to the source. In order 
to make this significance more easily comparable to other 
methods, confidence intervals are expressed in units of the 
standard deviation a for which a Gaussian distributed 
variable would give an equal probability of spurious de- 
tection (68%: la; 95%: 2a, etc.). So, the values of our 

^ For clarity, note that in the original BMW-HRI general 
catalog, a source was conservatively classified as extended if 
its extension Wdu and the relative error (adu) lay at more 
than 2 adu from this limit. 



figure of merit Ps will represent the number of ct's corre- 
sponding to the confidence level of that specific source. 
Note that the signal to noise in wavelet space can be dif- 
ferent from that in direct space for several reasons: i) the 
background subtraction is more accurate and locally per- 
formed; ii) the high frequencies are suppressed, so that a 
correlated count excess gives a higher significance than a 
random one with the same number of counts; iii) the ex- 
posure map can be incorporated in the wavelet space, so 
that artifacts do not affect the source significance. 

In the 914 HRI fields considered we have 194 sources 
meeting these requirements. Among these, we discovered 
that 22 were spurious sources caused by an hot pixel in 
the detector (not identified in the data reduction); this, 
coupled with the dithering of the satellite produces a high 
signal-to-noise extended source of 2' typical dimension al- 
ways located at the same detector coordinates. 

We inspected directly each of the remaining 172 
sources on DSS2/X-ray overlays and cross-correlated their 
positions with the NED database. As a result, we removed 
18 of them from the catalog as clearly associated with 
a nearby galaxy, thus ending up with a final list of 154 
cluster candidates. This represents our master statistical 
sample for follow-up. Clearly, even at ps < 4 cr a signif- 
icant number of sources are truly clusters. At this level, 
however, the contamination rises to ^ 40% (see Sect. l4.3|l . 
thus making the optical identification of real clusters sig- 
nificantly less efficient in terms of telescope time. 

3.3. Re-estimation of cluster parameters 

At the end of the analysis pipeline, the BMW detection 
algorithm yields a catalogue with 80 parameters for each 
source including the count rate, flux and extension (see 
P03 and Sect. I3.1|l . These values are reliable for point 
sources, but not for sources with a surface brightness pro- 
file (SBP) different from the instrumental PSF, as is the 
case for clusters. The observed cluster profiles are the re- 
sult of the convolution of the intrinsic SBP with the PSF of 
the instrument. The BMW algorithm calculates the source 
flux and extension by means of a minimization with 
respect to a Gaussian model in wavelet space (L99, see 
also previous section). Using a more appropriate (non- 
Gaussian) model within the pipeline is unpractical. First, 
in the general case a cluster model profile (e.g. a classi- 
cal /3-model or King profile, see Sect. ^ convolved with 
our Gaussian-based wavelet filter (L99) cannot be handled 
analytically, so that one of the simplifying features of the 
analysis pipeline is lost. Second, a direct profile fit to the 
data gives reliable results (in terms of ) only for the few 
most luminous objects in the sample, suggesting that an 
integral approach is the best way to proceed. 

For this reason, we adopted a simple and robust 
"growth-curve" technique, as applied to X-ray data by 
Bohringer et al. (2001), to re-measure the source total 
flux. Fig01 visually illustrates this technique. The source 
total flux is defined as the asymptotic value of the inte- 
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Fig. 4. Left panel: 24' x 24' portion of an HRI image (Gaussian smoothed with a — b") showing the detection of 
cluster candidate BMW044743. 4-202042, marked by the sohd 90" radius circle. The dashed circle (3' radius) indicates 
the central area containing the original HRI target (here barely visible), excluded from the cluster search. Right 
panel: Integral, background-subtracted flux growth curve of the candidate cluster (filled circles), plotted together 
with its 1 a error corridor (dotted hues) and the best-fitting /3-model (Grey dashed curve). The total fiux of the source 
is computed as the asymptotic value of the growth curve (horizontal dashed line). From the total flux, the core radius 
is then derived by comparison to a grid of (3 profiles convolved with the instrumental PSF at the specific off-axis angle. 
This is a typical source in our catalog, detected at ~ 7' off-axis with 75 net counts in a 10 ks exposure, corresponding 
to a flux of ^ 3 X 10~^^ erg s~^ cm~^. 



gral SEP, computed within increasing circular apertures 
on the background-subtracted images, when this reaches 
a horizontal "plateau" . The background-subtracted im- 
ages are obtained from the original images by the subtrac- 
tion of the background map (see Sect.^. Technically, the 
plateau is defined by studying the local derivative of the 
growth curve, comparing at each step the fiux variation 
within adjacent aperture radii to the expected mean error. 
To eliminate possible contamination from nearby sources, 
we first mask all other sources from the BMW-HRI gen- 
eral catalogue. Finally, we calculate the conversion from 
count-rates to fiuxes assuming a bremsstrahlung spectrum 
with temperature T = 5 keV and correcting for Galactic 
absorption using the appropriate column density at the 
source position (Dickey & Lockman 1990). Once the total 
flux of the source is established, we estimate its core radius 
by fitting its integral profile, adopting for the differential 
profile a classical /3-model (Cavaliere & Fusco-Femiano 
1976), with fixed /3 = 2/3 



I{r) = h 



,(2) 



where r in this case describes an azimuthal quantity, with 
Tc being the angular core radius. Choosing a fixed value for 
(3 is inevitable, given the low number of photons generally 
characterizing our sources, which does not allow a further 
free fit parameter. In this way, we obtain an estimate of 



the source extensions. This is a very important quantity, 
given that, as will be clear in the following, the statistical 
properties of the survey strongly depend on the extension 
and surface brightness values of the sources. As described 
in the next section, we assess these properties by means 
of Monte Carlo simulations: in these tests we coherently 
assume for the input simulated clusters the fixed slope 
/3 = 2/3 and variable core radii. 

The actual observed SEP is then described by the con- 
volution of the (5 profile with the instrumental PSF (which 
depends on the off-axis position). In practice, for each 
cluster candidate (i.e. total flux), we compute a numerical 
grid of expected SEPs for different values of the core ra- 
dius ranging from 2" to 120". We then measure the best 
fitting core radius by minimizing the between the ob- 
served and computed profiles. 

The uncertainties in the measurement of these param- 
eters are estimated via a further specific Monte-Carlo test, 
whose results are shown in Fig. The left panel of this 
figure shows that for sources with S/N> 4 our flux mea- 
surements are systematically under-estimated by ~ 12%. 
This is expected, as our fluxes are integrated out to a cer- 
tain radius, fixed by the growth-curve plateau. Ideally, the 
total fiux could be recovered by assuming a model profile 
and extrapolating it to infinity. We prefer not to do it, as 
this would be model-dependent and would result in larger 
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Fig. 5. Relative uncertainties in the flux (left) and core radius (right) estimation, at different signal-to-noise levels, 
derived from the Monte Carlo simulations of the cluster sample. They are calculated as the ratio (input-output)/input. 
The filled circles give the mean systematic relative error in each bin and its random standard deviation. 



errors (Vikhlinin et al. 1998a); rather, we include this un- 
certainty in the error budget. 

Fig. |31 also shows that, as one expects, the flux mea- 
surement errors depend on the source S/N ratio. We have 
therefore divided the cluster sample in S/N bins and as- 
signed to each source the corresponding relative error from 
the simulated sample. One may wonder why the behaviour 
of these errors is here studied as a function of S/N, rather 
than of the (almost equivalent) Ps confidence level defined 
in the previous section. The reason is that — unlike source 
detection — the measurement of flux and core radius is 
performed in real count space (not in wavelet space). 

The right panel of Fig. on the other hand, gives the 
typical errors in the estimates of cluster core radii. The 
way the values of f3 are chosen in the simulations requires 
some explanation, as it represents a subtle point. In fact, 
if one constructed input clusters with a fixed /3 = 2/3, 
the output of the simulation would describe the ampli- 
tude of statistical errors only (related to low S/N, etc.). 
We know that while /3 = 2/3 is a very good approxima- 
tion for most cases, clusters show a distribution of values 
around this (e.g. Ettori et al. 2004). This introduces an 
additional source of error when we try and measure the 
cluster core radius with our fixed f3 profile. In the simula- 
tion, therefore, /? was distributed according to a Gaussian 
with mean value 0.67 and standard deviation 0.05. With 
this choice, the relative error on Tc turns out to be ~ 20%, 
with negligible systematic errors as shown in Fig. [3 

The measured fluxes and core radii for all 154 sources 
in our list are plotted with their errors in Fig. |H1 The 
observed correlation between these two quantities is not 
surprising if clusters have an intrinsic distribution of sizes 
which is not fiat (e.g. Mohr et al. 2001; Arena 2002). One 




10-14 iQ-1?, 10-1^ 10-11 

flux [erg cin-^ s^i] 

Fig. 6. Angular sizes (core radii) versus measured X- 
ray fiuxes for the ps > A BMW-HRI candidate clusters. 
While an intrinsic correlation is expected between these 
two quantities, the slope observed here is probably infiu- 
enced by the lack of sources at high fiuxes, owing to the 
small volume sampled. 



should consider, however, that the lack of sources at high 
fiuxes, due to the small survey area, certainly contributes 
making the correlation steeper than it probably is intrin- 
sically. A discussion of these properties and of the distri- 
bution of angular sizes in the BMW-HRI catalogue goes 
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beyond the aims of this paper and will be discussed else- 
where (Arena et al., in preparation). 

The typical core radius of BMW-HRI cluster candi- 
dates measured from this figure is ~ 20". However, the 
high resolution of HRI data allows us to find a number 
of very small sources, characterized by core radii as small 
as ~5". We already know from our CCD optical follow 
up that several of these compact sources are not spuri- 
ous. Figd shows a CCD image in the Gunn-r filter (taken 
with EF0SC2 at the ESQ 3.6 m telescope) and the cor- 
responding HRI SEP of BMW145754.0-212458, a source 
with angular core radius 6"± 2". This candidate has been 
positively identified with a cD-dominated group at red- 
shift 0.312 and provides one key example of the specific 
advantage of the BMW-HRI catalogue with respect to 
previous PSPC-based surveys. Note that at the off-axis 
angle of this source (~ 6'), the instrument PSF has an 
half energy diameter of only 8". 

4. Monte-Carlo Simulations 

To quantify the limitations and assess possible systematic 
biases in the detection procedure, we performed extensive 
Monte Carlo simulations under realistic conditions. First, 
we embedded simulated clusters and point sources within 
realistic backgrounds and ran the detection pipeline, in 
order to evaluate the completeness and the characteriza- 
tion uncertainties of the catalogue. Second, we ran the 
detection procedure on simulated fields containing pure 
background, thus estimating the contamination of the cat- 
alogue due to spurious detections of background statistical 
fluctuations. In this section we describe the simulation re- 
sults in some detail. 
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Fig. 8. Comparison of the global pixel statistics of a real 
(black line) and a simulated image (Grey line), for a typi- 
cal exposure time of 20,000 sec. X-ray photons have been 
binned in pixels of 5" size. The histograms show how 
the background statistical properties of the simulated im- 
age accurately match those of the real one. The simulated 
background is built as a Poissonian realization of the back- 
ground map, and the differences that can be noticed for 
pixels with more than five counts are due to the effect of 
the simulated (extended) sources, which are overabundant 
in the simulated image. 



4.1. The simulated data 

Our simulated sample consists of 12 sets of HRI fields, 
each set characterized by a different exposure time evenly 
sampling (on a logarithmic scale, see the top-right panel 
of Fig. I10|l the range of the 914 observations which form 
the survey. Specific predictions for the whole set of actual 
exposure times are then obtained by interpolating through 
these 12 templates. 

For each template field we used the background maps 
built for the BMW general catalog as described in detail in 
Sect. 3.3 of C99. The procedure essentially uses the ESAS 
software (Snowden 1994), which produces a vignetted sky 
background map and a background detector map. The 
total background map is then obtained by summing the 
detector and sky maps. Finally, for each simulated field a 
Poissonian realization of such a background is built. Fig. 
IHl provides an example of how the simulated background 
pixel statistics reproduces the observed one in a corre- 
sponding HRI image. 

Point-like sources are simulated as pure PSF images 
with different normalizations, constructed by means of a 
ray-tracing technique (see C99), and their flux distribu- 
tion is generated so as to reproduce the observed X-ray 



number counts in the [0.5-2] keV band (Moretti et al. 
2003). In this way, we build a fully realistic X-ray sky, 
both in terms of background and surface density of point- 
like sources, in which to embed the simulated clusters. To 
construct these, we distribute counts according to a f3- 
model surface brightness profile (Eq.[21), with values of Tc 
randomly distributed in the range 5"-50", convolved with 
the PSF at the specific off-axis angle where the cluster is 
positioned. Fig.|51shows a visual comparison of a true clus- 
ter in the survey and a simulated one with the same core 
radius and integral flux. In each simulated image, we add 
three such extended sources, with fluxes randomly cho- 
sen in the range lO"^"* — 10~^^ erg sec~^ cm~^. All sources 
(both extended and point -like) are located at random po- 
sitions over the whole {9 — [3' — 15']) field of view. Overall, 
in a typical 20 ksec image this results in a mean number 
of 15 sources with more than 10 counts. The rather high 
number of simulated clusters allows us to reach a statis- 
tically signiflcant number of tests in a reasonable num- 
ber of simulations, still avoiding detection biases due to 
source confusion problems. To accumulate a statistically 
solid test sample, the procedure is repeated 1000 times for 
each of the 12 template exposure times. 
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Fig. 7. A specific example showing one of the major advantages of the BMW-HRI survey, i.e. its ability to detect as 
extended (and thus identify as groups/clusters) also sources with a very small core radius. This object, BMW145754.0- 
212458, has a core radius = 6"± 2 and would not be detected as extended in a PSPC image;. The optical follow-up 
confirmed this source as a real cD-dominated cluster at redshift at z = 0.312. In the left panel we show a CCD 
r — Gunn image, with HRI X-ray isophothes overlaid. The X-ray has been smoothed with a Gaussian filter (<t = 10") 
and the isophotes correspond to 1,1.3,1.7,2 standard deviations over the background (after smoothing). The right 
panel shows the measured growth curve (filled circles). The black solid line gives the integral best-fitting PSF- 
convolved /3-model, while the grey dashed line shows the instrumental PSF at the given off-axis angle. The horizontal 
dashed line defines the total flux of the source, as identified by the growth curve plateau. 



4.2. The catalogue completeness 

As discussed in the introduction, one of the advantages 
of X-ray surveys of clusters is the possibility to properly 
quantify their selection Junction Sp, i.e. the probability 
that a source with a given set of properties (e.g. flux, ex- 
tension) is detected in the survey. The selection function 
Sp fully characterizes the sample completeness and is the 
key ingredient for estimating its sky-coverage. 

In the ideal case of a uniform observation, given a value 
Her for the detection threshold in S/N ratio, the corre- 
sponding minimum detectable count rate (or flux limit) 
can be expressed by the following formula (see e.g. ROSAT 
Observer Guide 1992): 

_ Ugb^^'^dceH 1 , Cmin 

J^min — J. rr + , • V') 

J cell -y^exp '^exp 

Here dceii is the linear dimension of the detection cell 
fceii is the fraction of signal contained in the cell, b 
is the background count-rate and Cmin is the minimum 
number of source counts needed for detection when the 
background is negligible. However, in the low S/N regime 
where the source flux is comparable to the background 
noise, the concept of flux limit becomes somehow arbi- 

Note, as an aside, that in our case the wavelet transform, 
by its very nature, maximizes the S/N ratio for each source by 
optimizing the choice of the detection cell dimension. 



trary: sources with fluxes lower than the theoretical sen- 
sitivity limit can be detected with non-null probability. 
This is due to the fact that faint sources can be detected 
if they sit on a positive background fluctuation, while they 
are missed in the opposite case. For this reason, the con- 
cept of a sharp flux limit needs to be generalized with 
a statistical approach, in which the selection function de- 
scribes the probability for a source with given properties to 
be detected. Given the characteristics of X ray telescopes 
and speciflcally of the HRI data, this probability depends 
in general on: i) the exposure time of the observation ; ii) 
the source position within the field of view: moving from 
the center to the edges of the images, the X-ray mirror 
effective area decreases and the spatial resolution worsens; 
iii) the source extension. 

Assuming that extended sources can be sufficiently 
well described by the same /3-model profile with different 
core radii, the whole cluster survey can be characterized 
by a selection function Sp = Sp{f^9,tExp,rc), where / is 
the flux, 6 is the off-axis angle, rc is the apparent extension 
of the source and texp the exposure time of the observa- 
tion. Again, the only safe way to estimate the value of Sp 
within this multi-parameter space is by means of Monte 
Carlo simulations. 

To this end, we flrst grouped the input simulated 
sources into a grid deflned by 3 different angular extension 
ranges, r^ = [5" - 20"], r^ = [20" - 35"], r^ = [35" - 50"], 
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Fig. 9. Top panel: visual comparison of the HRI image 
of BMW000141. 3-154042 (left), the cluster detected with 
the largest number of net counts in the BMW-HRI survey, 
with a simulated X-ray source with the same counts and 
core radius. The bottom panel plots the average of the 
central 10 columns of the two images, with the real cluster 
described by the black line. 

together with 20 logarithmic bins in count rate. We then 
considered in each of the 12 simulated sets of images, 3 dif- 
ferent radial areas, defined by the annuli between [3' — 9'], 
[9' — 12'], and [12' — 15'] off-axis angles. At this point, we 
could compute the actual values of the selection function 
Sphy measuring the ratio of the number of detected to the 
number of input simulated sources within each bin of the 
4-dimensional hyper-space defined by exposure time te^p, 
fiux /, core radius and off- axis angle 6. The four panels 
in Fig. ^1 summarize the results of this procedure. The 
top-left panel considers a simulated field with t^xp — 40 
ksec and the annulus between 9' and 12'. The points and 
curves show how the "projected" Sp for different source 
extensions and as a function of flux is well fitted by a 
Fermi-Dirac function: 

SF{f,e,r,)^ ^ (4) 
e <= +1 

where /50 and c are the two free parameters of the func- 
tional form. The parameter /50 corresponds to the flux 
where Sp equals 0.5, or in other words, the detection prob- 
ability equals 50%, while c simply describes how sharp the 
cut-off in the selection function is. 

Considering different exposure times and off-axis an- 
gles, one gets analogous plots, with different values of the 
pair (/5o,c). By constructing a 3-dimensional grid of the 
best fit (/so, c) values over (a) 12 different exposure times, 



(b) 3 off-axis angles and (c) 4 core radii, we then obtain 
the general expression of the selection function Sp over 
the whole parameter space (see panels in Fig. I10|l . In this 
way, for a source with a given core radius within the range 
5"-50", we have the detection probability for an arbitrary 
exposure time observation (ranging from 1 ksec to 200 
ksec ), at an arbitrary position within the field of view 
(3'< 9 < 15') and for an arbitrary fiux (ranging from 
10""'^^ to 10~^^ erg sec~^ cm~^). For the few clusters with 
core radius larger than 50", Sp is safely obtained by lin- 
ear extrapolation, according to the bottom-right panel of 
Fig.Hni 

4.3. The catalogue contamination 

By construction, the detection threshold in the overall 
BMW HRI source catalogue is set by fixing the expected 
number of spurious sources: the higher the number of al- 
lowed spurious sources, the lower the threshold. This num- 
ber is set to 1 in ten fields for each scale of the wavelet 
transform, which corresponds to 0.4 expected spurious 
sources in each frame. This, however, should be taken 
as a theoretical estimate because it assumes a perfectly 
flat background so as to provide a general instrument- 
independent reference (L99). Here, we derive a more ac- 
curate estimate of the number of spurious detections by 
properly simulating a realistic background. Moreover, due 
to the empirical definition of extended sources, we cannot 
analytically predict the number of spurious sources that 
will be detected as extended and need to resort again to a 
Monte Carlo test. To this end, we build pure background 
images for the usual set of 12 templates with different ex- 
posure times (and different background values) ranging 
from 1 ksec to 200 ksec (see Sect. 0J. This time, we do 
not add any simulated source. For each of the 12 tem- 
plate images, we generate 200 different realizations of the 
background map, assuming a pure Poissonian noise. Then 
we run the BMW detection pipeline: clearly, all detected 
sources will be spurious, resulting from background fluc- 
tuations above the detection threshold. 

Within the 2400 simulated blank frames, we detect 
844 spurious sources (both point-like and extended), to 
be compared with the expected value of 0.4 x 2400 = 960. 
The close agreement between these two figures represents 
an encouraging a posteriori confirmation that the total 
number of spurious detections is well under control in the 
whole selection procedure. Among these, 108 sources meet 
the requirements of our cluster survey (extension and sig- 
nificance), providing us with an estimate of 0.045±0.003 
spurious Ps > 4 (7 detections in each frame of the sur- 
vey. Note that this value does not depend on the specific 
exposure time of the field, as in each field the detection 
threshold is pushed down only to the appropriate level so 
as to keep the total number of spurious detections con- 
stant. Given the 914 fields composing our survey, we thus 
expect 38±3 spurious sources to contaminate the sample 
of 154 cluster candidates (25%). Note that the expected 
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Fig. 10. Dependence of the selection function Sp on the relevant observational and source parameters, as obtained 
from the Monte Carlo simulations. Sp is defined as the ratio between the number of detected over the number of 
simulated input sources, as explained in the text. In the top-left panel we plot the results for the annulus between 
off-axis angle 9' < 9 < 12', texp = 40 ksec and 4 different SB profiles. The points are very well fitted by a Fermi-Dirac 
function with different values of the two free parameters (Eq. EJ. In the top-right panel for all 12 sets of simulated 
fields, we plot the values of /50 (in counts sec~^) for the region 9' < 9 < 12' as a function of exposure time. In the 
bottom-left panel we plot /50 as a function of the off-axis angle for a 5 ksec exposure and for four different source 
extensions. In the bottom-right panel we plot /50 as a function of source core radii, for a 30 ksec observation in the 
annulus between 3' and 10'. /50 turns out to depend linearly also on the value of r^. In all four plots the dashed lines 
reproduce the multidimensional fit as explained in the text. 



number of spurious detections is proportional to the num- 
ber of the analyzed fields, not related to the number of 
sources in the catalog. This means that, for example, if 
one concentrated only on a subset of high-exposure fields, 
the number of spurious sources would go down faster than 
the number of true clusters. However, the explored area 
and volume would also be reduced accordingly, thus af- 



fecting the ability to find hmrinous, rare clusters, which is 
one of our goals. 

There is a second source of contamination that needs to 
be considered, which is due to point-like sources mistaken 
as extended. This fraction is readily estimated from our 
first set of simulations, where we realistically simulated 
the observed sky in terms of surface density distribution 
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Fig. 11. The expected number of spurious detections in 
the BMW-HRI cluster catalogue as a function of the 
source significance, expressed in units of standard devi- 
ations a (see text for details) . The curve includes the con- 
tamination produced both by background fluctuations de- 
tected as true (extended) sources and by real point sources 
mistaken as extended by the algorithm. On the basis of 
this curve, the BMW-HRI > 4 master sample is ex- 
pected to have a 27% contamination, while selecting at 
Ps > 5 one obtains a sample with virtually null contami- 
nation. 

of point-like sources. The net result is that in the sample of 
154 Ps > cluster candidates, we expect 4±2 sources to 
be false classifications of a point-like source, which makes 
up for a total contamination of 42 ± 4, i.e. 27%. 

As shown by Fig. 1111 the number of total expected 
spurious detections (background fluctuations plus wrongly 
classified point-like sources), decreases significantly at 
higher detection significance: it is more than 40% at 3 a, 
27% at 4(7 and negligible beyond 5 a. The chosen signif- 
icance threshold at 4 cr for our main cluster catalogue is 
based on this plot, representing a good compromise be- 
tween the total number of sources and the expected con- 
tamination. 

5. Sky Coverage and Number Counts 

5.1. The BMW-HRI Cluster Survey Sky-coverage 

The sky-coverage il (SC hereafter), measures the actual 
surveyed area of the survey as a function of X-ray flux 
and of the other parameters characterizing each source: it 
is a necessary ingredient for any statistical computation 
involving the mean surface density of clusters in the cata- 
logue, like e.g. the log N -log S or the luminosity function. 

By dcflnition, it is closely related to the survey selec- 
tion function, which, as we have seen, depends also on the 



Fig. 12. The solid angle covered by the BMW-HRI survey 
(sky-coverage) as a function of flux and source extension, 
obtained by integrating the survey selection function over 
the observed flelds. 

off-axis angle 6 and the source size Tc (see Sect. I4.3|> . Let 
us therefore consider an infinitesimal annulus at an off- 
axis angle 0, characterized by a solid angle dto, within the 
field of view of a generic observation i. The area of this 
elementary surface will contribute differently to the to- 
tal sky coverage, depending on the fiux limit one chooses 
and on the source size: for example, depending on the 
exposure time of the observation, it will provide its full 
area at a given bright fiux, while it will be "invisible" 
at fainter fluxes. Or, equivalently, it will be effective for 
flnding very small sources, and become ineffective for very 
large, blurred sources. All these effects are already taken 
into account by the selection function Sp that we have 
carefully estimated through our Monte Carlo simulations. 
In fact, by deflnition, the actual contribution of this an- 
nulus to the total sky coverage will be 

dn, = dn,{f,d,rc)^SFAf,d,r,)dLO , (5) 

and the total SC yielded by that given fleld i for a source 
with flux / and size Vc will then be simply the sum over 
the annuli, i.e. 

f^.(/,rc) (/,0,r,)dc.(0) , (6) 

where in our case 9rnin = 3' and Omax = 15'. Finally, the 
overall SC of the survey, ^BAiwi f, Tc) will be given by the 
sum of the contributions by each single observation: 

^BMwif.r,) = Y.^^ = Y. SFif,e,r,)divie) ,(7) 

which shows explicitly the dependence of the total sky 
coverage on the source extension Tc fFig ll2|) . Thus, given 



Moretti et al.; The BMW-HRI Cluster Survey 



13 




10-14 lQ-13 IQ-IZ 

[0.5-2.0 keV] (erg s-' cm-=) 



Fig. 13. Comparison of the BMW-HRI size- weighted sky 
coverage to those of the EMSS (Gioia et al. 1990), 160 
Square Degrees (Vikhlinin et al. 1998a) and RDCS (Rosati 
et al. 1998). Since the sky coverage is a function of both 
flux and source extension, the BMW-HRI curve has been 
computed combining the sky coverages pertaining to dif- 
ferent source extensions, according to the observed distri- 
bution of angular sizes in the > 4 sample. 



5.2. Accounting for overlapping fields 

As anticipated in Sect. 12.21 7% of the survey area was 
observed more than once. Therefore, when summing the 
areas contributed by each field in the sky coverage calcu- 
lation we need to account for this repeated area so as to 
include it only once. To this end, let us consider a given 
source with flux /, which has been observed N times with 
different exposure times and therefore with different values 
of the selection function. In other words, the probability Pi 
of detecting this source is different in each exposure i, co- 
inciding with the value SpiOf the i-th selection function at 
flux /. Accordingly, the global selection function for that 
specific source SFif,rc) in the N observations will corre- 
spond to the probability of detecting the source within at 
least one of the N observations. If we consider the proba- 
bility Q of not detecting it in any of the observations, this 
is nothing else than the product of the probabilities of not 
detecting it in each image 

g = (1 - Pi) X (1 - P2) X ...(1 - Pjv) = - 5fJ .(8) 

i 

At this point, the selection function pertaining to that 
source, Tc) will be just the complement of Q, i.e. 

Sf^I-1[{1-Sf,) . (9) 



an observed source characterized by an (/, Tc) pair, it will 
always be possible to give a value for the total solid an- 
gle over which that source could have been found in the 
survey. 

Given the various dependences we have discussed, a 
comparison of the sky coverages as a function of X-ray 
flux among different surveys can only be done in an ap- 
proximate way, i.e. considering a "typical" sky coverage 
for each specific survey (e.g. Rosati et al. 2002). To per- 
form such a comparison, given that the source extensions 
correlate with their fluxes (Fig.lBJ, we tried and obtain a 
sufficiently realistic one-dimensional ^{f) by computing 
the mean Tc values within 5 bins in flux, and then con- 
structing a "composite" ID sky coverage which in each 
flux range reflects the typical size of sources with that 
mean flux. The result is shown in Fig. 1131 compared to 
the sky coverages of three other representative surveys 
from the literature. This plots shows that the BMW-HRI 
survey is competitive with existing PSPC surveys both at 
bright and faint fluxes. In particular, it provides an inter- 
esting solid angle coverage below 6x 10""'^'^ erg sec^^ cm~^, 
which results in the potential ability to detect a few clus- 
ters beyond z ^ 0.9—1. In fact, among the few BMW-HRI 
clusters with confirmed spectroscopic redshift, we already 
have two objects at z = 0.89 and z ~ 0.92. We remark 
that these values are specific for the low-contamination 
Ps > 4:a sample. In Sect. 15.31 we shall use this ID sky 
coverage to compute the expected redshift distribution of 
BMW-HRI clusters. 



5.3. The expected redshift distribution 

Using the computed sky coverage, we can now address one 
of the most interesting aspects of a deep cluster survey, 
i.e. the expected redshift distribution of the sample. In 
particular, it is of interest to ask how many clusters we 
expect in the cosmologically interesting range z ~ 1, if 
any, in different evolutionary scenarios. 

Briefly, the differential redshift distribution is obtained 
in the standard way as 

dn dn dV dn dV dl 

'dl dy~d^ ^ dy~dfd^ ' ^'^'^^ 
where dl/dz is the cosmological comoving line element, 
dV/dl — d\duj, dA is the angular size distance and doj 
is the elementary solid angle covered (e.g. Peebles 1993, 
pag. 331). The sky coverage enters here when we integrate 
over the whole observed solid angle, while dn/dV, i.e. the 
number of clusters per unit volume expected at the same 
z, is obtained by integrating the XLF from the minimum 
allowed luminosity (at that z) to infinity, 

/ TT^dLx- / HLx)dLx ,(11) 

where (j){Lx) is very well described by the usual Schechter 
functional form, yielding 

We therefore used the XLF parameters for the [0.5-2] keV 
band measured by the REFLEX survey (Bohringer et al. 
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2002), re-computed for a lambda-cosmology by MuUis et 
al. (2004), (j)* = 8.56 x IQ-^'h^ Mpc-^ L% = 1.295 
erg sec"-^, a = —1.69, to obtain the solid curves shown 
in Fig. El under the hypothesis of a non-evolving XLF. 
Rosati et al. (2000) first used a Maximum-Likelihood ap- 
proach to quantify in a phenomenological way the evo- 
lution observed in the XLF measured from the RDCS^. 
To this end, they parameterized evolution in density and 
luminosity through a simple power-law model, in which 

0*(z) = 0*(l + z)-^ , (13) 

L\{z) = Ll,{l + zf (14) 

and 4>Q, are the XLF parameter values at the cur- 
rent epoch. From their analysis of the RDCS survey they 
estimated A ~ —1.2, B = —2, indicating a statistically 
significant evolution in the mean density of luminous clus- 
ters. The same model has recently been applied to the 
160SD survey data by MuUis et al. (2004), where also the 
EMSS, NEP and RDCS constraints are added. By com- 
paring all these available estimates, they suggests that the 
best fitting values for the parameters A and B are A ~ 0, 
B ~ —2.5. These are the values we adopted for our compu- 
tations of the expected number of clusters in the BMW- 
HRI survey with evolution of the XLF. 

The overall results, with and without evolution, are 
compared in Fig. E| The first interesting point to no- 
tice concerns the total number of clusters predicted in the 
two scenarios. We have a prediction of a total of 89 to 132 
clusters in the Ps > 4: sample, depending on evolution. Our 
current number of candidates is 154, with a prediction of 
42 being spurious due mostly to background fluctuations. 
This yields a number of 112 true clusters in the survey, 
which is right between the predictions of the two curves. 
This is interesting, as it could imply that less evolution 
of the XLF is seen in the BMW-HRI sample. However, 
note that the simple (A,B) evolutionary model has been 
applied in the original form (Borgani et al. 2001), which, 
taken at face value, while reproducing correctly the ob- 
served deficit of luminous clusters above z ^ 0.6, also 
under-predicts the number of intermediate-redshift sys- 
tems, as can be seen from the left panel of Fig. where 
the two curves already begin to separate at at redshifts as 
low as z = 0.15, where we know that no evolution whatso- 
ever is observed. One possibility is to change slightly the 
Borgani et al. functional form describing the evolution of 
the XLF, as done by MuUis et al. (2004), by introducing an 
effective non-zero reference redshift in Eq. (|13|l and (|14|l 
which shifts the "switch-on" of evolution to larger, more 
realistic redshifts. 

The second interesting point to be appreciated from 
the figure is the significant number of high redshift sys- 
tems that the BMW-HRI survey should be capable of 
detecting. Even with the strongest evolution of the XLF 

^ A similar ML approach was used in a more physical way to 
estimate the values of SIm and ag from the observed evolution 
(Borgani et al. 2001) 



we expect 3 clusters with z > 0.8, with the number ris- 
ing quickly to ^ 10 if milder evolution is admitted. In 
fact, we already have three confirmed clusters in this range 
(see Sect. I^l, with a handful of further confirmed candi- 
dates with photometric redshift > 0.7. One of these is 
an extremely promising object showing a concentration 
of galaxies with r — Ks = [5.5 — 6] around the X-ray 
source position, with a range of colours typical of early- 
type galaxies at z ^ 1.2 — 1.3 (see e.g. Stanford 1997). 

5.4. The LogN - -LogS 

The cluster number counts (historically known also as the 
''LogN LogS" distribution) are the simplest diagnos- 
tic of cosmology /evolution that can be obtained from a 
sample of cosmological objects, without knowing their dis- 
tances. It also represents a basic test for the quality of the 
sample and/or the reliability of the sky coverage, being 
sensitive to residual biases and contaminations as a func- 
tion of flux. 

To compute the integral flux distribution, we weight 
each source by the inverse of the total area in the survey 
over which the source could have been detected. This is 
nothing else than the inverse of the sky-coverage value at 
the specific source flux and extension: 

"(>^) = Eff ■ (^^^ 

f>s f 

We used this formula and the estimated sky coverage to 
compute the number counts using a "high-purity" sample 
with Ps > 5 (7, containing 45 candidates, for which negligi- 
ble contamination is expected fFig. Ill|l. After estimating 
the appropriate sky coverage for this selection we com- 
puted the points reported in Fig^] We chose to bin the 
counts so as to have an equal increase AiV = 5 in the 
number of sources included in each bin, when moving to 
fainter and fainter fluxes. The error bars also include the 
uncertainty in the measured Tc, on which the sky coverage 
for each source depends. Our result is compared to simi- 
lar measurements from the RDCS (Rosati et al. 1998) and 
160SD (Vikhlinin et al. 1998a) surveys, and to the predic- 
tions for an unevolving and evolving XLF in the adopted 
Ho = 70 km s"^ Mpc-\ = 0.3, f^A = 0.7 cosmol- 
ogy. The XLF evolution is described using the same phe- 
nomenological (A, B) model used in the previous section, 
here using both the original parameters by Rosati et al. 
(2000, dotted line) and the milder values by MuUis et al. 
(2004, dashed line). We also note that the no-evolution 
curve is slightly higher than the one shown in Rosati et 
al. (2002), due to the different XLF parameters used here 
(REFLEX vs. BCS). The main resuU from this plot is that 
the BMW-HRI and PSPC data agree very well with each 
other and are globally consistent with a moderate evolu- 
tion of the XLF. The observed counts can be taken first 
of all as an a posteriori confirmation of the predicted low 
level of contamination of the > 5 u sample, and of the 
self-consistency of the computed sky coverage. More sub- 
stantially, the agreement with the other surveys indicates 
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Fig. 14. The expected redshift distribution of the BMW-HRI cluster sample, in differential form (left), showing 
the number of clusters expected in bins of Az = 0.1, and in integral form (right), showing the number of clusters 
expected above a given redshift z. The dashed curves in both panels refer to an empirical evolution model for the 
XLF as defined by Rosati et al. (2000), using a conservative pair of parameters (A=0,B=-2.5), which best describe 
the overall behaviour of available PSPC surveys (MuUis et al. 2004). 



that the bulk of the (faint) cluster population is consis- 
tently detected by both PSPC- and HRI-based surveys. 
However, one should keep in mind also the limitations of 
this kind of plot. In fact, the shape and amplitude of the 
logN-LogS is mainly sensitive to the faint /intermediate 
range of the XLF (dominated by low-redshift groups and 
clusters), while it is rather insensitive to changes in the 
number of rare, high-luminosity clusters at high redshift. 

6. Early follow-up results 

To provide a first hint of the general optical properties 
of BMW-HRI clusters, we briefly summarize here some 
early results from the ongoing follow-up campaign. Our 
main cluster identification strategy is currently based 
on multi-band imaging in the optical g, r, i bands and 
near-infrared J, H, Kg bands, with at least two, typically 
three of these bands secured for each target. A cluster is 
considered as confirmed when a significant over-density 
of galaxies with coherent colours is detected in the area 
of the X-ray source. The current identification statistics, 
based on a total sample of 119 candidates observed so 
far, gives 83 confirmed, 19 rejected and 17 still uncertain 
clusters, in substantial agreement with the contamination 
level estimated in this paper. Photometric redshifts for 
these objects are also estimated cither approximately 
from the mean colour of the cluster red sequence (when 
present), or more accurately using the photo-z codes 
of Fernandez-Soto et al. (2001), and Bolzonella et al. 
(2000) when 3 or more bands are available. A few of these 
clusters have also been observed spcctroscopically by us 
or have been found to be in common with other surveys. 



We cross-correlated our list with the latest version of 
the 160SD survey, with the unpublished RDCS catalogue 
and with the SHARC (Romer et al. 2000) and WARPS 
(Perlman et al. 2002) published lists. This comparison 
to PSPC-based surveys, performed early in the project, 
provided us with encouraging confirmations, indicating 
that the survey had the potential to efficiently peer 
into the high-redshift Universe. Common high-redshift 
clusters include, for example: • BMW122657.3+333253 
at z = 0.888, first discovered by Cagnoni et al. (2001) in 
the WGA survey and by Ebehng et al. (2001b) in the 
WARPS survey; • BMW052215. 8-362452 at z = 0.53, 
also found in the 160SD survey (MuUis et al. 2003); 
• two (unpublished) RDCS clusters at z = 0.64 and 
z = 0.808 (Rosati, private communication). Our cur- 
rent list of new high-redshift clusters includes another 
7 confirmed objects reliably located at Zphot > 0.6 
by r-i-Ks photometric redshifts. A few of these are 
shown in Fig. ^| These clusters will be the subject 
of specific future papers. X-ray/optical overlays and 
RGB images for more BMW-HRI clusters can be seen at 
\http:/ / www.merate.mi.astro.it/^ guzzo/ BMW/ g allery.htm\ 

7. Summary and Conclusions 

We have presented in detail the construction of the BMW- 
HRI Cluster Survey, a new sample of X-ray selected clus- 
ter candidates drawn from the so-far poorly explored 
ROSAT HRI archive. We have shown that by selecting 
sources with detection significance larger than 4(T and 
extension significance better than ~ 3.5 tr, one can ob- 
tain a reliable sample of 154 cluster candidates with flux 
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Fig. 15. The classical LogN LogS plot — i.e. the mean cumulative surface density of clusters as a function of 

X-ray flux — from the BMW-HRI survey (filled circles), compared to the RDCS (Rosati et al. 1998) and 160SD 
survey (Vikhlinin ct al. 1998b). The BMW-HRI points arc computed from the high-significance > 5o" sub-sample, 
for which the expected contamination is negligible. The curves give the expected counts in the same A-cosmology 
adopted in this paper, for the cases of no-evolution and evolution of the XLF. The BMW-HRI and PSPC data are all 
consistent with a negligible to moderate evolution of the XLF. 



larger than 2xl0~^^ erg sec~^ cm~^ and very interest- 
ing properties. We have performed extensive Monte Carlo 
simulations to recover the survey selection function with 
respect to the major parameters affecting the source de- 
tection. These have allowed us to reconstruct a sky cov- 
erage which is competitive with existing PSPC surveys 
over the whole range of fluxes, with a particularly inter- 
esting solid angle covered in the faintest bins. These re- 
sults show that, contrary to expectations, the HRI data 
— once they have been treated as discussed in C99, i.e. 
eliminating the noisiest energy channels — allow us to de- 
tect low-surface-brightness objects out to high redshifts, 
despite their higher instrumental background. We also es- 
timated the expected sample contamination due to back- 
ground fluctuations and false classifications, which turns 
out to be 27% for this significance threshold. 

We have used a high purity sample selected at ps > 5 cr, 
containing 45 objects expected to be virtually all true clus- 
ters, to estimate the cluster number counts in the range 
3x10"^'' to ~ lxlO~^^ erg sec~^ cm~^. This measure- 
ment is in very good agreement with previous estimates 
from the RDCS and 160SD surveys. 



Possibly the most important aspect of the BMW-HRI 
cluster survey is that it will be the first large sample of 
clusters to bc! drawn from an instrument independent from 
and with higher resolution than the ROSAT PSPC, on 
which virtually all serendipitous cluster surveys so far have 
been based^. This means that the BMW HRI survey po- 
tentially includes some types of object which could have 
been missed in previous surveys. It is natural to expect 
that these are mostly small groups, which the HRI is able 
to detect as extended. Indeed, we have specific examples 
in the survey of objects with core radii as small as 6". The 



Notable recent exceptions are represented by new samples 
being constructed using Chandra and XMM-Newton data, as 
done respectively by Boschin (2002) and by the XMM-LSS 
survey (Valtchanov et al. 2003). In the first case, an accurate 
study of the survey selection function, sky coverage and pre- 
dicted redshift distribution has been provided. Unfortunately, 
no deep follow-up is being done for this survey, where a sig- 
nificant fraction of clusters with z > 1 is expected. In the 
case of the XMM-LSS sample, on the other hand, two « ~ 1 
clusters have recently been discovered (Andreon, priv. comm.). 
However, no quantitative statistical description of the survey 
sky coverage is available yet. 
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Fig. 16. Optical/X-ray overlays for a selectfon of new BMW-HRI groups/clusters at different redshifts. From top left 
to bottom right: BMW134654.9-301328, z^pec = 0.358; BMW212415.7-334754, z^pec = 0.92; BMW122842.6-391612, 
Zphot ^ 0.8; BMW112059.0+130450, z^pec = 0.615. The redshift for BMW212415.7-334754, a very compact cluster, 
awaits further confirmation, being based on only 3 redshifts and displaying a second system at z = 0.62. The X-ray 
images have been smoothed with a Gaussian filter {a = 40") and the isophotes correspond to 0.7,1,2,3 standard 
deviations over the background (after smoothing). Optical images are combined Gunn-r + z ~ 60min total exposures 
with EF0SC2 at the ESO 3.6 m telescope. For all but the first object the sides of the figures are 2.5'. 



observed agreement among our number counts and those 
from the RDCS and 160SD surveys, however, indicates 
that the percentage of these small-sized objects does not 
seem to be sufficient to represent a substantial incomplete- 
ness in PSPC surveys. A finer assessment of possible dif- 
ferences will be provided once the redshifts for BMW-HRI 
clusters are measured, both by comparison of the faint end 
slopes of the cluster XLF and of the number of clusters 
detected above z ^ 0.8. While the faint end of the num- 
ber counts is dominated by low-redshift, low-luminosity 
objects, the HRI resolution should begin to make a differ- 
ence when one goes to high redshifts. The work of Ettori et 



al. (2004), based on Chandra observations, seems to indi- 
cate that z > 0.8 clusters have on average a more compact 
profile than their lowcr-redshift counterparts. Keeping in 
mind that all our simulations arc based on the necessary 
but crude approximation of a /3 = 2/3 profile, these find- 
ings would go in the direction of increasing the probability 
of detecting clusters at high redshift, favored by the HRI 
high spatial resolution. This would probably be too mild 
to show a significant effect in the integral number counts, 
yet might increase the number of detectable z ~ 1 clusters 
by a significant factor. 
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